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(54) Signal converting apparatus and signal converting method 

(57) The signal converting apparatus and method 
predictively produces highly accurate interpolated pix- 
els in accordance with a classification which precisely 
reflects a variety of signal characteristics of inputted 
video signals to provide a high resolution video signal. 
An activity is evaluated and classified (12) for each 
block of an inputted video signal (Si). Stepwise classifi- 
cations (p0-p2). (Fig 5. 22-24) are executed on each 
block of the inputted video signal (Si) and a classifica- 
tion (Cl) is selected in accordance with an activity code 
(cO) obtained as a result of the activity classification 
(21). In this way. the accuracy of subsequent classifica- 
tions can be increased, reflecting the activity character- 
istic of each block of the inputted video signal (Si ), thus 
achieving, as a whole, a highly accurate classification of 
the inputted video signal (SI). Appropriate prediction 
coefficients (d1) stored in a ROM (14) are read out 
based on the activity code (cO) and the class code (cl) 
for each block of the inputted video signal (Si). The 
coefficients are used in calculators (13) to produce 
highly accurate interpolated pixel values (d2-d5) which 
are selected by a selector 15 to provide a high resolu- 
tion video signal (S2). 
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Description 

This invention relates to a signal converting apparatus and to a signal converting method. Embodiments of the 
invention are applicable to upconvertors for converting standard definition signals (SD). for example NTSC signals or 
5 the like, to high definition signals (HD). for example High Vision or the like. 

Heretofore, this type of upconvertors perform frequency interpolation on SD video signals to increase the number 
of pixels in the SD video signals to produce HD video signals. For example, as shown in Fig. 1 . such upconvertors per- 
form double frequency interpolation respectively in the horizontal direction and in the vertical direction on an SD video 
signal composed of pixels represented by large "O" marks and large marks on scanning lines 1 of an HD image to 
w produce an HD video signal composed of pixels represented by small "O" marks and small marks. 

As an example of performing interpolation using an upconvertor, there is a method which produces HD pixels at 
four different positions from field data of an SD video signal. For example, taking an SD pixel represented by a mark 
"@" in consideration, HD pixels at four different positions mode 1 . mode 2. mode 3 and mode 4 in the vicinity of the SD 
pixel "@ '* are produced by interpolation of environmental SD pixels. An intra-space two-dimensional non-separable fil- 
?5 ter 2 shown in Fig. 2 and a horizontal/vertical separable filter 3 shown in Fig. 3 are used for this operation as interpola- 
tion filters. 

The two-dimensional non-separable filter 2 employs two-dimensional filters 4A to 4D to independently execute 
interpolation to generate HD pixels at four positions mode 1. mode 2. mode 3. mode 4, and converts the respective 
interpolated results into a serial form in a selector 5 to produce an HD video signal. 

20 The horizontal/vertical separable filter 3 executes interpolation for pixels at positions mode 1 . mode 3 with a vertical 

interpolation filter 6A and executes interpolation for pixels at positions mode 2, mode 4 with a vertical interpolation filter 
6B to produce data on two scanning lines of an HD video signal. Then, the filter 3 uses horizontal interpolation filters 
7A and 7B on the respective scanning lines to interpolate HD pixels at the four positions, and converts the interpolated 
results into a serial form in a selector 8 to produce an HD video signal. 

25 While the conventional upconvertor as described above employs an ideal filter as an interpolation filter, the spatial 

definition of a resulting HD video signal remains identical to that of an original SD video signal although the number of 
pixels are increased in the HD video signal. Also, the conventional upconvertor has a problem that it can only produce 
an HD video signal having a lower definition than that of an original SD video signal since an ideal filter cannot be used 
in practice. 

30 As a method for solving the problem mentioned above, a classification adaptive processing method which classifies 
an inputted SD video signal into several classes on the basis of the characteristics thereof and uses prediction coeffi- 
cients comprising prediction data previously generated by learning for each class to produce an HD video signal with a 
high definition is proposed. For example, such method has been proposed by the applicant of this invention in the spec- 
ification and drawings of U.S. application Serial No. 08/061.730 filed in May 17, 1993. 

35 However, this classification adaptive processing method implies a problem that a prediction accuracy for the HD 

video signal produced thereby is degraded unless an appropriate classification is carried out in accordance with the 
characteristics of an inputted SD video signal when prediction coefficients are generated by learning. In other words, 
without sufficient classification capability. HD video signals which would essentially classified into different classes may 
be grouped into the same class. Thus, prediction coefficients generated by learning will predict an average value of HD 

40 video signals of different nature, resulting in a degraded definition recovering capability 

One aspect of this invention provides a signal converting apparatus for converting a first inputted video signal into 
a second video signal different from the first video signal, comprising: means for evaluating an intra-space activity of the 
first video signal and outputting an activity code; means for executing stepwise classifications on the basis of the activity 
code and outputting a class code on the basis of the result of the classification; a prediction coefficient memory for stor- 

45 ing prediction coefficients for predictively producing the second video signal by using the first video signal; and means 
for performing a prediction calculation on the first inputted video signal by using the prediction coefficient read from the 
prediction coefficient memory in accordance with the activity code and/or the class code to produce the secorxj video 
signal. 

The first video signal may be a lower definition video signal, and the second video signal may be a higher definition 
50 video signal which is higher definition than the lower definition video signal. 

The second video signal may be a video signal that has the number of pixels more than the first video signal. 

The means for producing the activity code may evaluate the intra-space activity and activity in a temporal direction 
of the first video signal to output the activity code. 

The means for producing the class code may set a plurality of different pixel patterns to the first video signal, select 
55 a pixel pattern from a plurality of the set pixel patterns in accordance with the activity code, and classify the first video 
signal by using the selected pixel pattern to output the class code. 

In an embodiment of the invention, an intra-space activity of an inputted video signal is evaluated, and stepwise 
classifications are executed for the inputted video signal in accordance with the obtained activity code. In this way. sub- 
sequent classifications can increase the accuracy, reflecting the intra-space activity characteristic of the inputted video 
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signal, in addition, since the result of the previous classification is reflected to the stepwise subsequent classifications, 
the classification can be executed with a high accuracy. 

Prediction coefficients appropriate to the inputted video signal for each block are read based on at least a class 
code thus obtained to produce highly accurate interpolated pixels, thus providing a video signal at a higher resolution. 

5 Another aspect of the present invention provides a signal converting apparatus for converting a first inputted video 

signal into a second video signal different from the first video signal, comprising: means for evaluating the intra-space 
activity for the first video signal to output an activity code; means for executing stepwise classifications on the basis of 
the activity code to output a class code on the basis of the result of the classifications; and means, including a prediction 
value storing memory which stores the prediction value generated as an interpolation pixel signal of the first video sig- 

w nal. for reading and outputting a prediction value in accordance with the activity code and/or the class code. 

A further aspect of the present invention provides a signal converting method for converting the inputted first video 
signal into the second video signal different from the first video signal wherein the intra-space activity of the first video 
signal is evaluated to output the activity code, stepwise classifications is executed in accordance with the activity code, 
and a class code is outputted in accordance with the result of the classifications. Then, prediction coefficients stored in 

15 the prediction coefficient memory for predictively producing the second video signal by using the first video signal in 
accordance with the activity code and/or the class code, the prediction calculation is performed on the first inputted 
video signal using the read prediction coefficients, and a prediction calculation value is outputted as the second video 
signal. 

A yet further aspect of the present invention provides a signal converting method for converting the inputted first 
2C video signal into the second video signal different from the first video signal, wherein the intra-space activity of the first 

video signal is evaluated to output an activity code, stepwise classifications are executed on the basis of the activity 

code, a class code is outputted on the basis of the result of the classifications, a prediction value stored in the prediction 

value memory in accordance with the activity code and/or the class code is read, and the prediction value produced as 

the interpolation pixel signal of the first video signal is outputted. 
25 Embodiments of the invention seek to provide a signal converting apparatus and signal converting method which 

are capable of converting lower definition video signals into higher definition video signals by appropriate classifications 

corresponding to a variety of signal characteristics of inputted video signals. 

A better understanding of the invention will become more apparent from the following detailed description when 

read in conjunction with the accompanying drawings in which: 

30 

Fig. 1 is a schematic diagram explaining the relation between SD video signal and HD video signal; 

Fig. 2 is a block diagram showing a conventional two-dimensional non-separable interpolation filter; 

Fig. 3 is a block diagram showing a conventional vertical/horizontal separable interpolation filter; 

Fig. 4 is a block diagram showing an upconvertor comprising a two-dimensional non-separable filter according to 
35 the present invention; 

Fig. 5 is a block diagram showing the configuration of a classification unit shown in Fig. 4; 

Fig. 6 is a block diagram showing the configuration of an activity determination unit shown in Fig. 5: 

Fig. 7 is a schematic diagram showing an exemplary positioning of SD pixels; 

Figs. 8A. 8B and 8C are schematic diagrams showing class tap patterns for classification unit; 
40 Fig. 9 is a schematic diagram showing the prediction taps of learning data; 

Fig. 10 is a flow chart showing a prediction coefficient learning procedure; 

Fig. 11 is a schematic diagram explaining a hierarchical structure in a prediction coefficient ROM; 
Fig. 12 is a block diagram explaining an activity classification unit in an upconvertor according to a second embod- 
iment; 

45 Fig. 13 is a graph showing a frequency distribution based on the level of ADRC code value; 

Fig. 14 is a block diagram explaining an upconvertor according to a sixth embodiment; 
Figs. 15A, 15B, 15C, 15D and 15E are schematic diagrams explaining one-dimensional Laplactan filters; 
Fig. 16 is a table explaining indexes for a class code of first-step classification unit; 

Figs. 17A. 17B. 17C, and 17D are schematic diagrams explaining prediction tap patterns having a characteristic in 
50 a horizontal direction as a classification result of classification unit; 

Figs. ISA. IBB. 18C, and 18D are schematic diagrams explaining prediction tap patterns having a characteristic in 
a vertical direction as a classification result of classification unit: 

Figs. 19A. 19B. 19C, 19D. 20A, 208. 20C. and 20D are schematic diagrams explaining prediction tap patterns hav- 
ing a characteristic in an oblique direction as a classification result of classification unit; 
55 Fig. 21 is a block diagram explaining an upconvertor according to a seventh embodiment; 

Fig. 22 is a block diagram explaining a classification unit according to an eighth embodiment; 
Figs. 23A, 23B. 23C. and 23D are schematic diagrams explaining class tap patterns in classification units; 
Figs. 24A. 24B. and 24C are schematic diagrams explaining class tap patterns in classification units; 
Fig. 25 is a block diagram explaining an upconvertor according to a ninth embodiment; 
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Fig. 26 is a schematic diagram explaining a class generating block data used for generating a class code in a clas- 
sification unit according to an ninth embodiment: 

Fig. 27 is a schematic diagram explaining a hierarchical structure of class code according to an ninth embodiment; 
Fig. 28 is a schematic diagram explaining prediction tap patterns in a prediction calculation unit according to an 
5 ninth embodiment; 

Fig. 29 is a block diagram showing an upconvertor based on a vertical/horizontal separable filter according to the 
present invention: and 

Fig. 30 is a block diagram showing an upconvertor for generating HD pixel signals based on prediction values 
according to the present invention. 

10 

(1) First Embodiment 

Fig. 4 shows as a whole which employs a two-dimensional non-separable filter utilizing classification adaptive 
processing to produce an HD video signal from an SD video signal. An SD video signal Si inputted to the upconvertor 

75 1 0 through an input terminal IN is supplied to a classification unit 12 in a block-by-block scheme composed of predeter- 
mined number of pixels in which a remarked SD pixel is the center of the pixels, then the SD video signal Si is supplied 
to a prediction calculation unit 13. The classification unit 12 generates a class code dO of the remarked SD pixel on the 
basis of the characteristics of the SD pixels of the SD video signal Si in the vicinity of the remarked pixel of the inputted 
SD video signal Si. The class code dO is output as address data to a prediction coefficient ROM (Read Only Memory) 

20 ^ 4 which serves as storage means. 

The prediction coefficient ROM 14 stores prediction coefficients, which has been previously obtained by learning 
and which is used to predictively calculate HD interpolated pixels for producing a high definition video signal from a low 
definition resolution video signal, as prediction data d1 corresponding to the class code dO. The prediction coefficient 
ROM 14 reads the prediction data d1 using the class code dO as address data, and supplies it to the prediction calcu- 

25 lation unit 13. The prediction calculation unit 13 executes a predetermined prediction calculation on the SD video signal 
S, using the prediction data d1 to produce HD interpolated pixels from the SD video signal S,, The SD video signal S, 
is supplied via a delay unit which is not shown to the prediction calculation unit 1 3. A delay time of the delay unit corre- 
sponds to the time to finish supplying the prediction data d1 to the prediction calculation unit 13. 

The prediction calculation unit 13 is composed of four prediction calculators 13A to 13D. The respective prediction 

30 calculators 13A to 13D execute a product sum calculation using the prediction data d1 on the SD video signal S^ 
Thereby, the prediction calculators 13A to 13D produce prediction values d2. d3. d4. d5 for HD interpolated pixels cor- 
responding to pixels at four different positions mode 1 . mode 2. mode 3, mode 4 on a scanning line 1 , respectively. The 
respective HD interpolated pixels d2. d3. d4. d5 produced in the corresponding prediction calculators 13A to 13D are 
supplied to a selector 15. The selector 15 rearranges the respective prediction values d2, d3. d4, d5 into time-series 

35 data, using a buffer memory (not shown), which is then outputted from an output terminal OUT as an HD video signal 

Fig, 5 shows the configuration of the classification unit 12 shown in Fig. 4. As shown in Fig. 5. the SD video signal 
St inputted through an input terminal IN is supplied to an activity classification unit 21 . Then, in the activity classification 
unit 21 . for example, the spatial activity is classified for each block composed of 9 pixels of 3 x 3, with a remarked pixel 

40 being centered, to evaluate and determine the characteristics of each block. The activity classification unit 21 produces 
a class code cO on the basis of the classification and the evaluation of the spatial activity to output the class code cO to 
a selector 25 and an ADRC (adaptive dynamic range coding) classification unit 26. 

In addition, the SD video signal S^ is parallelly supplied to a wide region tap selector 22. a standard tap selector 23 
and a narrow region tap selector 24 for setting three different types of pixel tap patterns. The wide region tap selector 

45 22. the standard tap selector 23 and the narrow region tap selector 24 each select tap patterns pO. pi , and p2 corre- 
sponding to the space classes for the inputted SD video signal Si. 

Fig. 6 shows the configuration of the activity classification unit 21 shown in Fig. 5. As shown in Fig. 6, the SD video 
signal S^ inputted from the input terminal IN is outputted to a processing unit 30 at first. Then, the processing unit 30 
detects a dynamic range DR in a plurality of the SD pixels in which the remarked SD pixel is the center of the inputted 

50 SD video signal. The dynamic range DR is defined for example, using a maximum value MAX and a minimum value 
M IN in a neighboring region consisting of nine pixels in which the remarked pixel is the center (shown by "(§)") as 
shown in Fig. 7 by the following expression (1): 



55 



DR = MAX-M1N (1) 

The dynamic range DR of the plurality of the SD pixels in which the remarked SD pixel is the center is output to a thresh- 
old determination unit 31. The dynamic range DR is compared with a predetermined threshold in the threshold deter- 
mination unit 31 . As a result, the threshold determination unit 31 outputs a class code cO produced by comparing the 
dynamic range with the threshold In short, the spatial activity is determined by determining the size of the dynamic 
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range by three size (that is. the spatial activity is decided to one of three step of high, middle and tow) by the threshold 
value processing in the threshold determination unit 31 . Then the determined result is output as a class code cO repre- 
sented by two bits. It is generally thought that the spatial activity is high as the dynamic range DR is large, and the spa- 
tial activity is low as the dynamic range DR is small. In this way. the activity classification unit 21 executes a first step of 
5 a classification based on the dynamic range. 

Next, a next step of a classification in the wide region tap selector 22. the standard tap selector 23. the narrow 
region tap selector 24 and the ADRC classification unit 26 will be described specifically. 

First, among the foregoing three types of class tap pattern selectors, the standard tap selector 23 takes in consid- 
eration the standard intra-space variations of the inputted SD video signal Si and selects an ordinary class tap pattern 
JO as shown in Fig. 8B. On the contrary, the wide region tap selector 22 takes in consideration the regular intra-space var- . 
iations of the inputted SD video signal Si. That is. the wide region tap selector 22 selects a class tap pattern for a wide 
region as shown in Fig. 8A. Further, the narrow region tap selector 24 takes in consideration the irregular intra-space 
variations of the inputted SD video signal Si and selects a class tap pattern for a narrow region as shown in Fig. 8C for 
the irregular signal variations. 

15 The wide region tap selector 22. the standard tap selector 23 and the narrow region tap selector 24 respectively 

supply class tap patterns pO, pi and p2 respectively selected thereby to the selector 25. The selector 25 selects one of 
the class tap patterns pO, pi and p2 in response to a class code cO sent thereto from the activity classification unit 21 
as a selection control signal, and supplies the selected class tap pattern as a class tap pattern p3 to the ADRC classi- 
fication unit 26. That is. the selector 25 selects the class tap pattern pO from the wide region tap selector 22 when the 

2C Class code cO indicates that the spatial activity is tow. to the contrary, the selector 25 selects the class tap pattern p2 
from the narrow region tap selector 24 when the class code cO indicates that the spatial activity is high. 

The ADRC classification unit 26 sets the nurnber of requantization bits "k" in response to the class code cO which 
is used as a control signal. The class code cO has been generated in accordance with the dynamic range DR of the 
inputted SD video signal S^ In this way. the level resolution capability can be set differently for each tap in the tap pat- 

25 tern p3 selected for a space class, depending upon the dynamic range DR of the SD video signal Sv 

The ADRC re-quantizes pixels with a quantization step size defined as the re-quantization. An ADRC code cl ("ci" 
is used in the following equation in accordance with the number of SD pixels **i" in the class tap pattern) is represented, 
using the dynamic range DR. the number of re-quantization bits "k*', an SD pixel x,. and a minimum pixel level MIN in its 
neighboring region, by the following expression: 

30 



35 

The change of the level resolution capability for a tap pattern of a space class is carried out by changing the number 
of re-quantization bits "k * in the ADRC calculation represented by expression (2) in accordance with the class code cO. 
In this way. the level resolution capability can be adaptively changed and set in accordance with the dynamic range DR 
of an inputted signal. That is. the bigger the dynamic range DR becomes, the more the ADRC classification unit 26 set 
40 the level resolution capability in detail. 

The classification unit 12 thus generates a class code dO composed of the class code cO and an ADRC code cl . 
The class code dO is supplied to the prediction coefficient ROM 14 at a subsequent stage as address data. 

The prediction coefficient ROM 14 reads the class code dO composed of a combination of the class code cO and 
the ADRC code cl as address data, and supplies prediction data dl to be used to produce HD interpolated pixels to 
45 the prediction calculation unit 13. The respective prediction calculators 13A to 13D execute a prediction calculation 
using SD pixels x; comprising the SD video signal Si and prediction coefficients w, comprising prediction data dl for 
each class to produce predicted pixels y' for HD interpolated pixels corresponding to the positions mode 1 to mode 4 
on the scanning line 1 . 

The SD pixels x; used in this event are formed, for example, of thirteen prediction tap data comprising a remarked 
50 pixel (indicated by "@") and surrounding pixels (indicated by "O") positioned as shown in Fig. 9. Therefore, in this 
case, the prediction coefficients Wj comprise thirteen prediction coefficients for the respective prediction calculation 
units. Moreover, the SD pixels used in the respective prediction calculation units 13A to 13D are identical with one 
another, but the prediction coefficients w, from the prediction coefficient ROM 14 are different in the respective predic- 
tion calculation units 13A to 13D, so that the prediction coefficient ROM 14 stores four groups of the prediction coeffi- 
55 cients comprising thirteen prediction coefficients corresponding to one class. 

The predicted pixels y' for the HD interpolated pixels are transformed and produced, using the foregoing thirteen 
SD pixels X, and the prediction coefficients w,. by the following expression (3): 
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X X: = W, XX, 



'13 
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(3) 



5 The respective prediction calculation units 13A to 13D executes the prediction calculation by the expression (3) using 
the SD pixels and the prediction coefficients respectively supplied, and produces the HD interpolation pixels. 

The prediction coefficients Wj used herein have been previously generated by learning and stored in the prediction 
coefficient ROM 13. 

Next, a learning procedure for generating prediction coefficients for each class stored in the prediction coefficient 
10 ROM 14 will be described by referring a flow chart shown in Fig. 10. 

The prediction coefficients are generated in accordance with a prediction coefficient learning procedure shown in 
Fig. 10. Upon starting the prediction coefficient learning procedure at step SPI. learning data corresponding to previ- 
ously known images are first generated at step SP2 for learning prediction coefficients Wj. 

Specifically, in the HD image shown in Fig. 1 . an HD interpolated pixel is designated as an HD remarked pixel, and 
15 this HD remarked pixel is expressed by a linear primary combination model using prediction coefficients by a set of 
learning data comprising surrounding HD interpolated pixels and SD pixels. The prediction coefficients used in this 
event are calculated using a least squares method for each class. In addition, in generating learning data as described 
above, if a plurality of images are used, instead of a single image, to generate a multiplicity of learning data, more accu- 
rate prediction coefficients can be generated. 
20 It is determined at step SP3 whether or not a sufficient number of learning data have been generated at step SP2 

for obtaining the prediction coefficients. If it is determined that the number of generated learning data is less than a 
required number, the prediction coefficient learning procedure proceeds to step SP4. 

At step SP4. class learning data are classified. The classification is performed in such a manner that a local flat- 
ness is first detected for learning sampling data, and pixels used for the classification are selected in accordance with 
25 the detection results. In this way. pixels exhibiting small changes of the input signal are removed from data to be 
learned, so that the influence of noise can be eliminated. The classification of the class learning data is carried out by 
executing the same processing as that used for classifying the inputted SD video signal S^ 

More specifically, the classification of the class learning data begins with the classification and evaluation of the 
dynamic range DR of learning data to set a class code cO. Subsequently, a tap pattern p3 is selected from three kinds 
30 of wide region, standard and narrow region tap patterns based on the class code cO as a space class. Then, as shown 
in Fig. 1 1 , the class code cO thus generated is combined with an ADRC code cl to set a class code dO. and this class 
code dO is stored in ROM in correspondence to prediction data d1 . 

Subsequently, the prediction coefficient learning procedure forms at step SP5 a normalization equation for each 
class based on the classified learning data. 
35 The processing at step SP5 will be specifically explained. However, for generalization, described below is a case 
where "n" sampling pixels exist as learning data. First, the relationship between pixel levels Xi. Xp of respective sam- 
pling pixels and a pixel level "y" previous to a subsample of a remarked interpolated pixel is expressed for each class 
by a prediction expression represented by a linear primary combination model using "n" taps of prediction coefficients 
w, Wp. The prediction expression is given by the following expression (4): 

40 

jap 

y = ^ Wj X Xj (4) 

45 The prediction coefficients w^ w^ in the expression (4) are calculated to predict the pixel level "y". 

Next, an example will be given for showing how to generate the prediction coefficients w^ w^ by a least squares 

method. The least squares method is applied as follows. 

As a generalized example, the following observation expression (5) is considered where X represents a set of input 
data, "w" a set of prediction coefficients, and "Y" a set of predicted values. 

50 

XW = Y 

where. 

55 
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The least squares method is applied to data collected by the observation expression given by expression (5). in the 
example given by expression (5). "n" is equal to "13". and "m" represents the number of learning data. 

First, based on the observation expression of the expression (5). the following residual expression (6) is consid- 
ered: 

XW = Y + E 



where, 



20 



E = 



(6) 



25 



It can be thought from the residual expression given by the expression (6) that the most probable value of each w, 
is derived when a condition for minimizing the solution of the following equation (7) is satisfied. 



30 



(7) 



More specifically, when a partial differential expression by w, of expression (7) is expressed by the following expres- 
sion (8): 



35 



de ^ 



ei 5^^e2-^^+ em^=0(i=1.2 n) 



Ow 



()W, 



(8) 



40 conditions to the number of "n" are considered based on T in the expression (8). and Wi. W2 Wp satisfying these 

conditions may be calculated. Thus, the following expression (9) is derived from the residual expression (6). 



45 



f)w ^ " ^^'1* 5w^ ~ Ow^ 



x.m(i = ''.2 n) 



(9) 



50 



From the expression (9) and the expression (8), the following expression (10) is derived: 

n n n 

J^e,x„ =0.£e,x,2=0 ^ 



e|X,^ = 0 



(10) 



Then, from the expression (6) and the expression (10), the following normalization expression (1 1) is derived; 
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W 9 + 



( Z Xji Xjn )Wn = ( 2 Xji y j ) 



3 = -^ 



j = l 



( Z Xj2 Xji)wi + ( 2 Xj2) 



W 2 + 



( Z Xj2 Xjp. )Wr, = ( Z Xj2 Yj) 



( Z Xjn Xji)wi + ( 2 Xjn Xj^)W2 + 

+ ( Z X:)n Xjn)Wn = ( Z Xjn yj) 
j = 1 j = 1 



. .(11) 



Since the number of normalization equations given by the expression (11) equal to the number "n" of unknowns 
can be formed, the most probable value of each Wj can be calculated from these normalization expression. 

The normalization expressions can be solved using a sweeping-out method (Gauss-Jordan's elimination method). 
3C The prediction coefficient learning procedure repeats a loop of steps SP2-SP3-SP4-SP5-SP2 until the same 
number of the normalization expressions as the number **n" of unknowns are formed for calculating indefinite coeffi- 
cients w, Wn for each class. 

When the required number of normalization expressions are thus formed, an affirmative result is derived for a 
determination at step SP3 as to whether or not learning data have been ended, followed by the procedure proceeding 
35 to a determination of prediction coefficients at step SP6. 

At step SP6. the normalization expressions given by the expression (11) are solved to determine the prediction 

coefficients w, w^ for each class. The prediction coefficients thus determined are stored at the next step SP7 in a 

storage means such as ROM which has its storage area divided for each class. At this case, four group of the prediction 

coefficients comprising the prediction coefficients w^ w^ respectively corresponding to the prediction calculators 13A 

40 to 1 3D are stored with respect to one class code. By the foregoing learning procedure, the prediction coefficients for the 
classification predictive processing are generated, followed by the termination of the prediction coefficient learning pro- 
cedure at the next step SP8. 

Next, the operation of the aforementioned upconvertor 10 of the first embodiment and each unit of the upconvertor 
will be described. An SD video signal inputted to the upxonvertor 10 through the input terminal IN is first supplied 

45 parallelly to the classification unit 12 and the prediction coefficient calculation unit 13. The classification unit 12 gener- 
ates a class code dO based on the SD video signal S, and supplies the generated class code dO to the prediction coef- 
ficient ROM 14. The prediction coefficient ROM 14 reads prediction data d1 previously obtained by learning in 
accordance with the class code dO, and supplied it to the prediction coefficient calculation unit 13. The prediction coef- 
ficient calculation unit 13 produces HD interpolated pixels corresponding to four positions (mode 1 to mode 4) on a 

50 scanning line 1 based on the SD video signal Si sent from the input terminal IN and the prediction data d1 supplied 
from the prediction coefficient ROM 14 in the respective prediction calculators 13A to 13D. 

In the classification unit 12. the activity classification unit 21 first detects a dynamic range DR of a plurality of SD 
pixels in which the remarked SD pixel of the inputted SD video signal Si is the center, and compares the dynamic range 
DR with a predetermined threshold to output a class code cO. Generally, the spatial activity is high as the dynamic range 

55 is large, and conversely, the spatial activity is low as the dynamic range is small. 

Meanwhile, the inputted SD video signal Si in block units is parallelly supplied to the wide region tap selector 22, 
the standard tap selector 23 and the narrow region tap selector 24 for setting three different pixel tap patterns. And the 
wide region tap selector 22, the standard tap selector 23 and the narrow region tap selector 24 set tap patterns pO, pl 
and p2 for the respective space classes. 
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The selector 25. based on the class code cO. selects a class tap pattern pO having a signal change over a relatively 
wide range as shown in Fig. 8A for an SD video signal Si having a small dynamic range DR and a low activity in order 
to reflect a slow signal change to the class. On the other hand, the selector 25 selects a class tap pattern p2 having a 
signal change over a narrow region as shown in Fig. 8C for an SD video signal S^ having a large dynamic range DR 

5 and a high activity in order to express a signal change in a narrow region with a largest possible number of classes. In 
this way. depending upon the signal characteristics in view of the dynamic range DR. the selector 25 selects and sup- 
plies a space class represented by a tap pattern p3 reflecting a signal change of an associated SD video signal Si to 
the ADRC classification unit 26 at the next stage. 

The ADRC classification unit 26, using the class code cO as a control signal, sets a small value to the number of 

JO requantization bits "k" of each tap for the space classification for an SD video signal S, having a small dynamic range 
DR. This results in reducing the level resolution capability of each tap. thus outputting an ADRC code c1 on the assump- 
tion that the SD video signal Si is stable. On the other hand, the ADRC classification unit 26 sets a larger value to the 
number of requantization bits "k" of each tap for the space classification for an SD video signal Si having a large 
dynamic range DR. in order to output an ADRC code c1 with a higher level resolution capability. In this way an unstable 

15 signal change of the SD video signal Si having a large dynamic range DR and a high activity can be reflected to the 
class- 

As described above, the classification unit 12 changes a tap pattern of pixels used for the classification in accord- 
ance with the dynamic range DR of an inputted SD video signal Si. and also changes the number of re-quantization 
bits "k" of each tap for the classification to adaptively set the level resolution capability This can provide an appropriate 

20 classification in accordance with the characteristics of the dynamic range of the inputted SD video signal S,. 

The classification unit 12 combines the class code cO with the ADRC code cl to generate a class code dO which is 
supplied to the prediction coefficient ROM 14 at the next stage. In the prediction coefficient ROM 14. prediction data di 
is read based on the class code dO and supplied to the prediction calculation unit 13. The prediction calculation unit 13 
produces HD interpolation pixels by transforming SD pixels into HD interpolation pixels using the prediction data d1. 

25 The HD interpolated pixels are supplied to the selector and rearranged time-series at the selector 1 5 and output as HD 
video signal. Thus, the selected prediction data d1 reflects the characteristics of the inputted SD video signal S, in 
terms of the dynamic range DR, thereby making it possible to improve the accuracy of HD interpolated pixels, produced 
by transforming SD pixels, and improve the spatial resolution capability of an HD video signal S2. 

According to the foregoing embodiment, an SD video signal Si inputted to the upconvertor 10 undergoes a deter- 

30 mination in the activity classification unit 21 as to whether its dynamic range DR is larger or smaller than a threshold 
value. Then, a tap pattern suitable for the characteristics of the SD video signal Si in terms of the dynamic range DR 
can be set from tap patterns for three kinds of space classes (a wide region tap pattern, a standard region tap pattern, 
or a narrow region tap pattern), based on a class code dO generated as the result of the determination. It is therefore 
possible to set a class tap pattern which reflects the characteristics of the inputted SD video signal S, in terms of the 

35 dynamic range DR. 

Also, according to the foregoing embodiment, the number of re-quantization bits "k" of each tap for the space clas- 
sification is changed in accordance with a class code cl to change the level resolution capability of each tap. thereby 
making it possible to reflect stable or instable signal changes to the classification by use of the level resolution capability 
of the tap. In this way. the inputted SD video signal S^ is appropriately classified with a tap pattern and a level resolution 
40 capability of the tap set in accordance with the dynamic range DR of the inputted SD video signal Si . making it possible 
to produce an HD video signal S2 having a high spatial resolution which reflects the signal characteristics of the inputted 
SD video signal Si. 

(2) Second Embodiment 

45 

Fig. 12 shows an activity classification unit 35 of the upconvertor according to a second embodiment. The activity 
classification unit 35 evaluates an intra-space activity of an SD video signal S,. comprising a plurality of SD pixels in 
which a remarked pixel is the center, inputted thereto in block units and classifies the remarked pixel in accordance with 
its characteristics. 

50 The SD signal S, inputted from an input terminal IN is supplied to the ADRC classification unit 36 which executes 

a classification based on ADRC for the SD video signal Si comprising a plurality of the SD pixels in which the remarked 
pixel is the center. 

An ADRC code "c" (It should be noted that c, is used for the ADRC code c in the expression (1 2) in order to conform 
to the number i of SD pixels in a class tap pattern) outputted from the ADRC classification unit 36 is generated from a 
55 dynamic range DR, a number of re-quantization bits "k". an SD pixel x,. and a minimum pixel level MIN within a neigh- 
boring region of the SD pixel as expressed by the following expression (12): 
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X. • MIN 



C; = 



DR 



(12) 



5 similarly to the first embodiment. 

The ADRC code "c" generated in the ADRC classification unit 36 is supplied to a post-processing unit 37 at the next 
stage. The post-processing unit 37 represents a variation degree of a level distribution pattern indicated by the ADRC 
code "c". for example, a standard deviation a for the ADRC code "c" is calculated. And the post-processing unit 37 sup- 
plies the calculated standard deviation a for the ADRC code "c" to a threshold determination unit 38 at the next stage. 

10 The threshold determination unit 38 generates and output a class code cO by comparing the standard deviation a for 
the ADRC code "c" with a threshold for determination. The standard deviation a for the ADRC code "c" is expressed by 
the following expression (13) using the ADRC code "ci". an average value "ca" of the ADRC code "ci", and a number of 
the ADRC code "n". 

J5 pTTi 



The selector 25 shown in Fig. 5 uses the class code cO thus generated to select a tap pattern p3 for a space class. 
20 similarly to the first embodiment. Also, the ADRC classification unit 26 adaptively sets a level resolution of each tap 
based on the class code cO. In this way. the tap pattern and level resolution are set for the space class based on the 
result of classifying the spatial activity of the inputted SD video signal thus producing similar effects to the first 
embodiment. 

25 (3) Third Embodiment 

As third embodiment, for classification, the standard deviation a may be calculated, for example, in view of a data 
distribution of nine pixels around a remarked pixel of an inputted SD video signal Si shown in Fig. 7, and generated an 
class code cO by a threshold determination for the calculated standard deviation c. 

30 In other words, in the processor 35 of the activity classification unit 21 shown in Fig. 6. the standard deviation a is 

calculated in view of the data distribution of nine pixels around the inputted remarked pixel. 

In short, in the processor 35. the SD signal Si inputted from the input terminal IN is supplied, the standard deviation 
a is calculated in view of the data distribution of nine pixels around the inputted remarked pixel. And the processor 30 
supplies the calculated standard deviation a to the threshold determination unit 31 at next stage. The threshold deter- 

35 mination 38 generates and outputs a class code cO by a threshold determination of the standard deviation a. The stand- 
ard deviation a is expressed by the following expression (14) using the SD pixels xi. an average value xa in a 
neighboring region , and a number of pixels "n" in the neighboring region. 



In this way. in the third embodiment, the classification is executed by a threshold determination using this standard devi- 
ation a. In general, the spatial activity is high as the standard deviation is large, and conversely, the spatial activity is 
45 low as the standard deviation is small. Therefore, by changing a space class tap pattern and the level resolution capa- 
bility of a tap for the space classification based on the threshold determination of the standard deviation a. the same 
effects as the foregoing embodiments can be obtained. 

(4) Fourth Embodiment 



Further, as the forth embodiment, a frequency distribution table which registers the values of the ADRC code "c" 
represented a variation degree of a level distribution pattern indicated by the ADRC code "c" may be produced, and the 
a class code cO data may be produced by a threshold determination using the generated frequency distribution table. 

More specif ically. in a frequency distribution table as shown in Fig. 13. the number of ADRC codes "c" existing 
55 between threshold "0" and threshold "1 " is counted, and the ratio of pixels existing in this region to pixels existing out of 
this region is determined for classification. In this case, if the ADRC codes "c" are concentrated, for example, in a par- 
ticular region, it may be determined that the spatial activity of the SD video signal is low. On the other hand, if the ADRC 
codes c are widely spread, it may be determined that the spatial activity is high. 



In other words, in an ADRC classification unit 36. the SD signal Sj inputted from the input terminal IN is supplied. 




(13) 



40 




(14) 



50 
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the plurality of the SD pixels in which the remarked pixel is the center are executed the classification based on the 
ADRC. 

The ADRC code "c" generated in the ADRC classification unit 36 is output to the post-processing unit 37 at the next 
stage. The post-processing unit 37 generates a frequency distribution table registered an ADRC code "c" as shown in 

5 Fig. 13 representing a variation degree of level distribution pattern based on the ADRC code "c". Then the post- 
Processing unit 37 outputs a data representing a frequency distribution table for the generated ADRC code "c" to a 
threshold determination unit 38 at the next stage, the threshold determination 38 counts the number of ADRC code 
existing between threshold "0" and threshold "1" in a frequency distribution table, thereby executes a threshold deter- 
mination, and generates and outputs a class code cO. 

10 Therefore, a space class tap pattern and a level resolution capability of a tap for the space classification are 

changed based on the threshold value determination using the frequency distribution table for this ADRC code "c". 
thereby obtaining the same effects as the foregoing embodiments. 

(5) Fifth Embodiment 

75 

Further, as the fifth embodiment, absolute values of differences of respective adjacent SD pixel values may be reg- 
istered in the frequency distribution table so as to produce a class code "cO" by evaluating a spatial activity using the 
frequency distribution table. In this case, the spatial activity is high as a large number of pixels have large absolute dif- 
ference values, and conversely, the spatial activity is tow as a large number of pixels have small absolute difference val- 

20 ues. 

An adjacent pixels difference calculation unit for calculating absolute values of differences of respective adjacent 
pixel values is set instead of the ADRC classification unit 36 in the activity classification unit 35 shown in Fig. 12. 

In short, in the adjacent pixels difference calculation unit, an SD signal Si inputted from the input terminal IN is sup- 
plied, and difference between adjacent pixels is calculated on plurality of adjacent SD pixels in which the remarked pixel 

25 is the center to generate absolute value of the calculated difference. The absolute difference value generated in the 
adjacent pixels difference calculation unit is supplied to the post-processing unit 37 at next stage. The post-processing 
unit 37 produces a frequency distribution table, representing a variation degree of level distribution pattern based on the 
ADRC code "c**. in which absolute difference value has been registered. Then, in the post-processing unit 37. data rep- 
resenting the frequency distribution table for the generated absolute difference value is output to a threshold determi- 

30 nation unit 38 at next stage. The threshold determination unit 38 counts the number of pixels existing between threshold 
"0" and threshold "1 " in the frequency distribution table for absolute difference value, thereby a class code cO is pro- 
duced ard output by executing a threshold determination. 

Accordingly, a space class tap pattern and a level resolution of a tap for the space classification are changed on 
the basis of a threshold determination of the absolute difference values of the adjacent pixels, thereby producing the 

35 same effects as the foregoing embodiments. 

(6) Sixth Embodiment 

Fig. 14 shows an upconvertor 50 according to a sixth embodiment. The upconvertor 50 executes a first dimensional 

40 laplacian operations in plural directions in laplacian filters 51 A to 51 E of a first classification unit 50A, and a first-step 
classification by synthetically deciding the values. A class tap pattern of second classification unit 50B is set according 
to the classification result of the first classification unit 50A. Then the second classification unit 50B executes a classi- 
fication using the class tap pattern. 

For the sixth embodiment, an activity of temporal direction and spatial direction are evaluated in the first step clas- 

45 sification. The structure of the sixth embodiment will be described by using Fig. 14. 

An SD video signal Si supplied from the input terminal IN is supplied to the classification unit 50A. Then the SD 
video signal Si supplied to classification unit 50A Is supplied to the laplacian filters 51 A to 51 E and a delay circuit 56B. 
The five different laplacian filters 51 A to 51 E perform laplacian operations in different directions from each filter, on each 
frame or each field of the inputted SD video signal S^ in order to output laplacian values LO to L4. 

50 fVlore specifically, the Laplacian operations are performed by the one-dimensional Laplacian filters 51 A to 51 E in 
the horizontal direction (Fig. ISA), the vertical direction (Fig. 158). a rightwardly declining oblique direction (the direc- 
tion of a diagonal extending from the upper left end to the lower right end on the plane of the drawing) (Fig. 15C). a left- 
wardly declining oblique direction (the direction of a diagonal extending from the upper right end to the lower left end on 
the plane of the drawing) (Fig. 15D). and a temporal direction (Fig. 15E), as shown in Figs. 15A to 15E. 

55 Laplacian values LO to L3 resulting from the Laplacian operations performed by the Laplacian filters 51 A to 51 D are 

supplied to absolute value circuits 52A to 5?'- the next stage, respectively. The absolute value circuits 52A to 52D 
calculate absolute values of the respective Laplacian values LO to L3 supplied thereto. And resulting absolute values 
aO to a3 are supplied to a maximum value detector 53. The maximum value detector 53 detects a maximum value from 
the absolute values aO to a3. and compares the maximum value with a threshold value THO. With this operation, a flat- 
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ness of the inputted SD video signal Si is detected, a value (flatness values) LIO indicating the flatness represented in 
one bit is output. Further, the maximum value detector 53 outputs a value (maximum value detecting direction value) 
alO indicating a direction which a maximum value represented in two bits is detected. Simultaneously, the laplacian fil- 
ter 51 E outputs a laplacian value L4 in the temporal direction to an absolute value circuit 52E. The absolute value circuit 

5 52E calculates absolute value of the laplacian value L4, and supplies resulting absolute value a4 to a comparator 54. 
The comparator 54 compares the absolute value a4 with a threshold value TH1. thereby outputs a value (temporal 
direction changing value) a1 1 indicating a change in the temporal direction represented in one bit. Four bit data which 
is a combination of the temporal direction changing value a1 1. the above-mentioned maximum value alO and the flat- 
ness value L10 is supplied to the classification unit 50B at the next stage as a control signal CT. 

w Since the laplacian value is basically a total of spatial differences between a remarked pixel and respective pixels 

on both sides, the laplacian value is larger as adjacent pixels present large changes. The first classification unit 50A 
performs the one-dimensional laplacian filtering on the activity in a predetermined direction in the space in order to 
detect a direction in which an edge exists in a space, and roughly classifies its characteristic. 

The control signal CT outputted from the classification unit 50A is supplied to selectors 55A to 551 and a delay cir- 

/5 cuit 56A. The selectors 55A to 551 are connected to a register array 57 through lines to which an SD data selected by 
the control signal CT is supplied. The register array 57 is supplied with SD video signal Si of delayed several lines por- 
tions through the delay circuit 56B. The selectors 55A to 551 are selectively switched in response to the control signal 
CT. and selects SD image data supplied from the register array 57 in accordance with corresponding indices, and sup- 
plies pixel data for nine pixels to a one-bit ADRC classification unit 58 at the next stage. In short, the class tap pattern 

20 of the classification unit 50B at the next stage is selected in accordance with the control signal CT 

The ADRC classification unit 58 uses a class tap pattern formed of nine taps selected by the selectors 55A to 551 
to execute one-bit ADRC classification to output an ADRC code "c" represented in nine bits. As a result, the classifica- 
tion unit 508 provides 512 (2^) types of different classes. 

Consequently, as 16 classes provided by the first classification unit 50A is multiplied by 512 classes provided by 

25 the second classification unit SOB. the upconvertor 50 can classify a unit block of SD video signal into 8192 classes. By 
thus selecting an appropriate tap pattern by the ADRC classification at the next step in accordance with a class selected 
at the first step based on the spatial activity, a highly accurate classification can be accomplished at the next and sub- 
sequent steps, reflecting the spatial activity of the SD video signal. 

A prediction value calculation unit 50C at the stage subsequent to the classification unit 50B is composed of a pre- 

30 diction coefficient RAM 59 for storing prediction coefficients for HD interpolated pixels, a prediction tap pattern setting 
unit 60 for setting a prediction tap pattern for producing HD interpolated pixels, and a prediction calculation unit 61 for 
producing HD interpolated pixels by executing calculations using the prediction tap pattern and a prediction coefficient. 

A prediction coefficient is read from the prediction coefficient RAM 59 at a location indicated by address data using 
two signals consisting of an ADRC code "c" supplied from the one-bit ADRC classification unit 58 and a control signal 

35 CT delayed through the delay circuit 56B. The read prediction coefficient is supplied to the prediction calculation unit 
61 . On the other hand, the SD video signal Si sent from the register array 57 is output to the prediction tap pattern set- 
ting unit 60. The prediction tap pattern setting unit 60 sets a prediction tap pattern used for a prediction calculation unit 
61 at the next stage. That is. from the inputted SD signal data Si. the coefficient read from the prediction coefficient 
RAM and the prediction tap pattern executed prediction calculation at the prediction calculation unit 61 at the next stage 

40 are output. The prediction calculation unit 61 uses pixel data of the prediction tap pattern and the prediction coefficient 
to produce and output HD interpolated pixels by a linear first-order combination. 

Here, four-bit control signal CT generated by the classification unit 50 A. and a class tap pattern selected by the con- 
trol signal CT and used at the classification unit 50B will be described. 

Fig. 16 shows a table fisting indexes "10 to 13" irxjicating 16 (2^) ways of different combinations of the four-bit control 

45 signal CT generated by the classification unit 50A. Figs. 17A to 17D. 18A to 18D. 19A to 19D, and 20 A to 20D show 
examples of tap pattern configurations of the classification unit 50B at the next stage provided corresponding to the 
indexes "10 to 13". The relation between the control signal CT and the tap pattern, is a relation in which a class tap pat- 
tern in a large change direction (a direction of large laplacian value) in the spatial direction, and spatial wide of tap 
becomes smaller as a maximum of a laplacian value is targe. Further the relation is that a tap pattern is composed of 

50 tap on the same field if a laplacian value in the temporal direction is large, and the tap pattern is composed of tap on a 
different field (for example, a frame) from the tap pattern if the laplacian value in the temporal direction is small. 

Tap patterns shown in Figs. 17A to 17D are set corresponding to indexes "0000" to "0011". respectively, and 
selected on a large change in the horizontal direction. 

Comparing the class tap patterns shown in Figs. 1 7A to 1 7D. it can be seen that the tap patterns for indexes "00 1 0". 

55 "001 1 " shown in Figs. 1 7C and 1 7D have pixels spaced at narrower intervals in the horizontal direction than the tap pat- 
terns for indexes "0000" and "0001" shown in Figs. 17A and 17B. whereby the laplacian value of a maximum value in 
the horizontal direction is represented large. That is. the spatial wide of the class tap pattern becomes smaller as a 
laplacian value of a maximum value in the horizontal direction is large, also the spatial wide of the tap pattern becomes 
larger as the. laplacian value of maximum value in horizontal dii ection is small. Also, as shown in Figs. 17B and 17D. 
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when bit {fourth bit) in the temporal direction of the indexes "00.0 V and "001 1" is on. the associated class tap pattern is 
positioned in the same field to represent a large change in the temporal direction. On the contrary, as shown in Figs. 
1 7 A and 17C. when bit (fourth bit) in the temporal direction of the indexes "0000" and "0010" is off. taps are positioned 
on different fields to represent a small change in the temporal direction. 

5 The class tap patterns for the indexes "01 10" and "01 11" shown in Figs. 18C and 18D have pixels spaced at nar- 

rower intervals in the vertical direction than the class tap patterns for the indexes "0100" and "0101" shown in Figs. 18A 
and IBB, whereby the laplacian value of a maximum value in the vertical direction is represented large. That is. the spa- 
tial wide of the class tap pattern is smaller as the laplacian value of a maximum value in the vertical direction is large, 
the spatial wide of the class tap pattern is larger as the laplacian value of a maximum value in the vertical direction is 

w small. Also, as shown in Figs. 18B and 18D, when there are changes in the temporal direction of the indexes "0101" and 
"01 1 1 ". in other words when the bit (fourth bit) in the temporal direction is on, the class tap pattern is positioned in the 
same field to represent a large change in the temporal direction. On the contrary, as shown in Figs. 18A and 1 SC. when 
the bit (fourth bit) in the temporal direction of the indexes "0100" and "0110" is off. taps are positioned on different fields 
to represent a small change in the temporal direction. 

15 The class tap patterns for the indexes "1010" and "1011" shown in Figs. 19C and 19D have pixels spaced at nar- 

rower intervals in the rightwardly declining oblique direction than the class tap patterns for the indexes "1000" and 
"1001" shown in Figs, 19A and 198. whereby a laplacian value of a maximum value in the rightwardly declining oblique 
direction is represented large. That is. the spatial wide of the class tap pattern is smaller as the laplacian value of a max- 
imum value in the rightwardly declining oblique direction is large, the spatial wide of the class tap pattern is larger as 

20 the laplacian value of a maximum value in the rightwardly declining oblique direction is small. Also, as shown in Figs. 
198 and 19D. when bits in the temporal direction of the indexes "1001" and "1011" are on. the associated class tap pat- 
tern is positioned in the same field to represent a large change in the temporal direction. On the contrary, as shown in 
Figs. 19A and 19C. when bits in the temporal direction of the indexes "1000" and "1010" are off. taps are positioned on 
different fields to represent a small change in the temporal direction. 

25 The class tap patterns for the indexes "1 110" and "1 111" shown in Figs. 20C and 20D have pixels spaced at nar- 

rower intervals in the leftwardly declining oblique direction than the class tap patterns for the indexes "1 101" and "1111" 
shown in Figs. 20A and 208. whereby a laplacian value of a maximum value in the leftwardly declining oblique direction 
is represented large. That is. the spatial wide of the class tap pattern is smaller as the laplacian value of a maximum 
value in the leftwardly declining oblique direction is large, and the spatial wide of the class tap pattern is larger as the 

30 laplacian value of a maximum value in the leftwardly declining oblique direction is small. Also, as shown in Figs. 208 
and 20D. when bits in the temporal direction of the indexes "1101" and "1111" are on. the associated class tap pattern 
is positioned in the same field to represent a large change in the temporal direction. On the contrary, as shown in Figs. 
20A and 20C. when bits in the temporal direction of the indexes "1 100" and "1110" are off. taps are positioned on dif- 
ferent fields to represent a small change in the temporal direction. 

35 In this way. the class tap pattern, existing a class tap in large change direction (a direction of the large laplacian 

value) in the spatial direction, is set so that the spatial wide of the laplacian value is smaller as a maximum value of the 
laplacian value is large. Further, the class tap pattern is set so that the tap pattern is composed of taps in the same field 
if the laplacian value in the temporal direction is large, and the tap pattern is composed of taps on the different field (for 
example, a frame) from tap pattern if the laplacian value in the temporal direction is small. 

40 Next, the foregoing upconvertor 50 of the sixth embodiment and the operation of each unit of the upconvertor 50 

will be described. Upon inputting an SD video signal Si to the first classification unit 50A. the SD video signal Si is sub- 
jected to Laplacian filtering in a plurality of different level directions in the space by the Laplacian filters 51 A to 51 D, 
Absolute values aO to a3 of the resulting Laplacian values LO to L3 are calculated by the absolute value circuits 52A to 
52D and outputted therefrom. 

45 Then, the absolute values aO to a3 are supplied to the maximum value detector 53 to detect a maximum value of 
the absolute values of the supplied laplacian values. The direction of an edge in the space of the inputted video signal 
is detected depending upon the direction of the Laplacian value LIO which has detected the maximum value. And the 
value alO indicating the direction of the detected maximum value (detecting direction value of maximum value) is rep- 
resented in two bit form. Further, the maximum value can be compared with a threshold to represent a flatness of the 

50 inputted SD video signal Si. and the value representing the flatness (flatness value) LIO is represented in one bit form. 
Also, a laplacian value L4 in the temporal direction is determined by a laplacian filter 51 E. Absolute value a4 of the lapla- 
cian value L4 is calculated by an absolute value circuit 52E and outputted therefrom. The absolute value a4 is com- 
pared with a threshold TH1 by a comparing circuit, thereby output as a value representing a change of the temporal 
direction (change value of temporal direction) a1 1. In this way. the characteristic of the inputted SD video signal Si can 

55 be roughly revealed. The classification unit "A supplies a four-bit control signal CT comprised of two-bit detecting 
direction value of a maximum value a 10. onr^ j.i flatness LIO. and one-bit change value of the temporal direction all to 
the second classification unit 508. 

The classification unit 508 at the next stage sets a spatial class tap pattern on the basis of an index "10 to 13" cor- 
responding to the four-bit control signal CT More specifically, the classification unit 508 switches the selectors 55A to 
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551 in response to the control signal CT, selects a class tap pattern determined to the input SD video signal supplied 
through the register array 57, and supplies the selected tap pattern to the ADRC classilicatlon unit 58. In this Way. a 
highly accurate classification can be accompfished based on a class tap pattern reflecting the spatial activity of the SD 
video signal 

5 Also, the SD video signal Si is output from the register array 57 to the a prediction calculation unit 50C at the next 

stage. In a prediction tap pattern unit 60. a prediction coefficients read from a prediction coefficient RAM 59 in accord- 
ance with a control signal CT and an ADRC code "c". and a prediction tap pattern for executing a prediction calculation 
in the prediction calculation unit 61 are set. then the set prediction tap pattern is output to the prediction calculation unit 
61 . In the prediction calculation unit 61 . a prediction calculation is executed by a linear first-order combination using the 

w prediction coefficients read from the prediction coefficient RAM 59 in accordance with the control signal CT and the 
ADRC code "c". and the prediction tap pattern supplied from the prediction tap pattern unit 60. thus the HD interpolated 
pixels are output. 

According to the configuration described above, the laplacian filters 51 A to 51 E in the first classification unit 50A 
execute a one-dimensional laplacian operations in plural directions to perform the first classification by synthetically 
15 deciding the values. Next, a class tap pattern of the second classification unit 50B is set in accordance with the result 
of the first classification unit 50A. Then the second classification unit SOB performs a classification using the class tap 
pattern. Therefore, a highly accurate classification of the SD video signal Si can be accomplished reflecting the spatial 
activity of the SD video signal Si, thus producing similar-effects to the foregoing embodiments. 

20 (7) Seventh Embodiment 

Fig. 21. where parts corresponding to those in Fig. 14 are designated the same reference numerals, shows an 
upconvertor 65 according to a seventh embodiment. In similar to the sixth embodiment, in the seventh embodiment, a 
class tap pattern for a classification of second classification unit 508 in accordance with the first classification using the 

25 laplacian filters 51 A to 51 E of the first classification 50A is set. and a second step classification is executed on the basis 
of the class tap pattern, thus realizing the classification at two steps. 

The configuration of the upconvertor 65 in the sixth embodiment will be described using Fig. 2 1 . An SD video signal 
Si inputted from an input terminal IN is supplied to a classification unit 65A. Then the SD video signal Si supplied to 
the classification unit 65A is respectively supplied to laplacian filters 51 A to 51 E and a delay unit 56A. Five laplacian 

30 filters 51 A to 51 E execute laplacian operations on the inputted SD video signal Si of a block of each frame or each field 
of the supplied SD video signal in the different directions for each filter, and output laplacian values LO to L4. The lapla- 
cian filters 51 A to 51 E use same laplacian filters used in the upconvertor 50 in the sixth embodiment shown in Fig. 14. 
The laplacian filter values LO to L4 are respectively supplied to quantizers 66A to 66E at the next stage. 

The quantizers 66A to 66E calculate absolute values of corresponding Laplacians LO to L4. and execute non-linear 

35 quantization to output quantized values qO to q4. For example, if the quantizers 66A to 66E convert the input SD video 
signal S^ by quantization, into quantized values qO to q4 each of which represents one of two values such as "0" and 
+ 1". the inputted SD video signal Si can be classified into a total of 32 (2^) ways of different classes. That is. the quan- 
tizers quantize so as to assign a quantized value "0" in case of small absolute value, and assign a quantized value "1** 
in case of big absolute value. Then the quantized values qO to q4 is supplied to a combiner 67. the combiner 67 com- 

40 bines the quantized values to generate a five-bit class code, and outputs the five-bit class code to a second classifica- 
tion unit 65B as a control signal CT representing first step classification. 

The classification unit 65B switches selectors 55A to 551 on the basis of the control signal CT, similarly to the sixth 
embodiment, and selects a tap pattern composed of nine pixels which are SD pixels supplied through a register 57. The 
tap pattern composed nine pixels selected by the selectors 55A to 551 is supplied to an one-bit ADRC classification unit 

45 58, ar>d the one-bit ADRC classification unit 58 executes the second step classification to produce 512 (2^) types of dif- 
ferent class codes by executing an ADRC operation for the inputted class tap pattern. In this way. 16384 ways of differ- 
ent classes are accomplished by a combination of class of the first-step classification unit 65A and class of the second- 
step classification unit 65B. The ADRC classification unit 65 generates a nine-bit class code, and the nine-bit class code 
and the first-step class code supplied through the delay circuit 568 are supplied to the prediction value calculation unit 

50 65C. The explanation of the calculation unit 65C is omitted because of similarity to the foregoing sixth embodiment. 
Also, figure of a class tap pattern used at the classification unit 658 according to a control signal CT supplied from the 
first classification unit 65A is omitted. However, as the class tap pattern, a tap pattern is set so that a spatial wide of a 
tap pattern is smaller and a tap is wider in the direction in which the laplacian value is large as a laplacian value is large. 
Also, the tap pattern is set so that the tap pattern is composed of tap on the same field in case where the laplacian value 

55 in the temporal direction is large, the tap pattern is composed of tap on a different field (for example, frarrie) in case 
where the laplacian value in the temporal direction is small. By foregoing configuration, it is possible to produce similar 
effects to the foregoing embodiments. 
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(8) Eighth Embodiment 

Fig. 22. where parts corresponding to those in Figs. 4. 5 are designated the same reference numerals, shows a 
classification unit 70 of an upconvertor according to an eighth embodiment. An SD video signal Si inputted from an 

5 input terminal IN is parallelly supplied to an ADRC classification unit 71 and a plurality of class tap selectors 72 (72A to 
72G). The ADRC classification unit 71 executes a classification of the inputted SD video signal Si in accordance with 
a pixel level distribution pattern by an ADRC operation using four pixels of around remarked pixel extracted from the 
inputted SD video signal S, . and supplies a selector 73 with a resulting ADRC code "c". The selector 73 is also supplied 
with tap patterns classified as space classes by the class tap pattern selectors 72A to 72G each for setting a spatial 

iG class tap pattern determined in accordance with the characteristics of the inputted SD video signal Si . 

More specifically, for a level distribution (73A. 73B. 74A. 74B, 75A. 75B. 76A. 768. 77A. 778. 78A, 788. 79A. 798. 
80A, 808) indicated by each ADRC code "c" resulting from one-bit ADRC operation performed on every four pixels for 
around the remarked pixel of the inputted SD video signal S^ each of the selectors 72A to 72G sets a class tap pattern 
(81 . 82. 83. 84. 85. 86. 87) for representing the signal characteristics corresponding to the associated level distribution. 

15 as shown in Figs. 23A. 238. 23C. 23D. 24A. 248. and 24C. 

For example, level distributions 73A and 738 indicated by ADRC codes "c* respectively exhibit level distribution 
characteristics in a rightwardly declining direction (the direction of a diagonal from the upper left end to the lower right 
end on the plane of the drawing). Since an edge of the image is thought to exist in that direction, a class tap pattern 81 
in the rightwardly declining oblique direction is corresponded to the level distributions 73A and 738. Similarly, since level 

20 distributions 74 A and 748 indicated by ADRC codes "c" respectively exhibit level distribution characteristics in a left- 
wardly declining direction (the direction of a diagonal from the upper right end to the lower left end on the plane of the 
drawing), a class tap pattern 82 in the leftwardly declining oblique direction is corresponded to the level distributions 
74 A and 748, In addition, since level distributions 75A and 758. 76A and 768 indicated by ADRC codes "C* exhibit level 
distribution characteristics offset on the left side and on the right side, respectively, class tap patterns 83 and 84 having 

25 prediction taps offset on the left side and on the right side are corresponded to the level distributions 75A and 758. 76A 
and 768, respectively. 

Further, since level distributions 77A and 778. 78A and 788 indicated by ADRC codes "c" exhibit level distribution 
characteristics offset on the upper side and on the lower side, respectively, class tap patterns 85. 86 having class taps 
offset on the upper side arxi on the lower side are corresponded to the level distributions 77A and 778. 78A and 788. 
30 respectively. Since level distributions 79A and 798, 80A and 808 indicated by ADRC codes "c" exhibit regular level dis- 
tribution characteristics, a class tap pattern 87 using all class tap is corresporxled to the level distributions 79A and 798, 
80A and 808. respectively. 

In this way. a class tap pattern plO is selected in the selector 73 in accordance with the ADRC codes "c" from the 
class tap patterns set in the tap pattern selectors 72A to 72G. and supplied to an ADRC classification unit 26 at the next 

35 Stage. The ADRC classification unit 26 uses the selected prediction tap pattern plO to perform a one-bit ADRC opera- 
tion on the SD video signal Si and supplies a prediction coefficient ROM 14 with the resulting ADRC code dO as 
address data for reading corresponding prediction coefficients. 

Next, the operation of the classification unit 70 in the upconvertor in eighth embodiment will be described. 

The inputted video signal S^ is supplied to first ADRC classification unit 71 . which executes an one-bit ADRC oper- 

40 ation using four pixels for around the remarked pixel and produces a class code "c *. Also, the inputted SD video signal 
Si is classified into a plurality of class tap patterns 81 to 87 reflecting the signal characteristics in the class tap pattern 
selectors 72Ato 72G. Then a class tap pattern plO is selectedfrom the plurality of class tap patterns 81 to 87 in accord- 
ance with the class code "c" supplied from the first ADRC classification unit 71 . and output to an ADRC classification 
unit 26 at the next stage. The ADRC classification unit 26 executes one-bit ADRC operation on the inputted SD video 

45 Signal Si formed the selected class tap pattern plO to generate a class code dO and supplies the class code dO to the 
prediction coefficient ROM 14 as address data. 

According to the configuration described above, for producing address data in order to select prediction coeffi- 
cients, a class tap pattern is selected in accordance with the spatial activity of an inputted SD video signal Si detected 
by the ADRC classification unit 71 at the first stage. Then the second-step ADRC classification is executed by the 

50 ADRC classification unit 26 using the selected class tap pattern plO. so that the class code dO reflecting the spatial 
activity of the inputted SD video signal Si can be generated, thus producing similar effects to the foregoing embodi- 
ments. 

In addition, according to the embodiment described above, a prediction tap pattern used for a prediction calculation 
is changed in accordance with the spatial activity of the inputted SD video signal S^ so that the calculation processing 
55 can be reduced when there are many prediction taps. 

Further, as the first embodiment, number of quantization bits in second-stage ADRC classification unit 26 can be 
switched on the-basis of the outputted signal outputted from first-stage ADRC classification unit 71 . 
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(9) Ninth Embodiment 

Fig. 25 shows an upconvertor 90 according to a ninth embodiment. The upconvertor 90 first execute a rough clas- 
sification for an intra-space activity in accordance with a one-bit ADRC operation in a first-stage classification unit and 
5 then -executes a multi-bit ADRC operation for a detailed classification in a second-stage classification unit. 

In the upconvertor 90. a SD video signal Si inputted through an input terminal IN is respectively output to a first- 
stage block 91 . delay circuits 97 and 101 . The first-stage block 91 extracts a block of n x m pixels (for example. 5x5 
pixels in Fig. 26) centered on a remarked pixel (indicated by in Fig. 26) in a current frame or field of an SD video 
signal Si in a first-stage block 91 as shown in Fig. 26. and supplies a one-bit ADRC classification unit 92 with resulting 
w block data bl. 

The one-bit ADRC classification unit 92 executes a one-bit ADRC operation on the block data bl consisting of 5 x 
5 pixels and supplies a resulting ADRC code clO to a ROM 94, after passing it through a delay circuit 93A for adjusting 
the timing. The one-bit ADRC classification unit 92 further supplies a comparator circuit 95 with a dynamic range DR 
calculated when the ADRC code clO was derived. Simultaneously with this, the one-bit ADRC classrfication unit 92 

15 supplies a multi-bit ADRC classification unit 96 with the dynamic range DR and a minimum pixel level fVIIN. 

The comparator circuit 95 compares the dynamic range DR with a threshold TH and supplies the comparison result 
CR to the ROM 94 through a delay circuit 93B. The multi-bit ADRC classification unit 96 does not execute a classifica- 
tion based on the comparison result CR if the dynamic range DR is smaller than the threshold TH (CR equals "0"). On 
the other hand, the multi-bit ADRC classification unit 96 executes the classification if the dynamic range DR is larger 

20 than the threshold TH (CR equals "1"). However, in case of this embodiment, the multi-bit ADRC classification unit 96 
is generally executed. Therefore in a ROM 94 at subsequent stage, an ADRC code cl 1 from the multi-bit ADRC clas- 
sification unit 96 is ignored, thereby it is considered not to execute the multi-bit ADRC classification unit 94. Also, so as 
not to execute the multi-bit ADRC classification, the comparison result CR is supplied to the multi-bit ADRC classifica- 
tion unit 96 as shown a broken line in Fig. 25, thereby it has to be controlled not to make it execute the multi-bit ADRC 

25 classification unit 96. 

The SD video signal S, delayed through the delay circuit 97 to have its timing adjusted is supplied to a second- 
stage block 98 which defines, for example, a block data b2 of nine pixel data consisting of 3 x 3 pixels including a 
remarked pixel, as shown in Fig. 26, which is supplied to the mutti-bit ADRC classification unit 96. The multi-bit ADRC 
classrfication unit 96 uses the dynamic range DR and the minimum pixel level MIN calculated in one-bit ADRC classifi- 

30 cation unit 92 to classify the block data b2, and supplies a resulting ADRC code cl 1 to the ROM 94. 

As shown in Fig. 27, a table has been previously produced by learning for class code dO having a hierarchical struc- 
ture in accordance with the classification of ADRC code clO derived as the result of the one-bit ADRC classification unit 
92 and ADRC codes cl 1 derived as the result of the multi-bit ADRC classification unit 96, and has been stored in the 
ROM 94. The classification unit 90 reads a class code dO in accordance with ADRC codes clO and cl 1 derived by the 

35 one-bit ADRC classification unit 92 and the multi-bit ADRC classrfication unit 96 based on the comparison result CR, 
and supplies the read class code dO to a prediction coefficient RAM 99 at the next stage. Likewise, prediction coeffi- 
cients derived by learning are stored in the prediction coefficient RAM 99 in accordance with the class codes dO 
arranged in a hierarchical structure. 

A set of prediction coefficients are sequentially read from the prediction coefficient RAM 99 with the class code dO 

40 used as address data, and supplied to a prediction calculation unit 100 for calcutatirng HD interpolated pixels by execut- 
ing a product sum calculation with the SD video signal S,. The prediction tap setting unit 102 is supplied with the SD 
video signal Si delayed through a delay circuit 101, and outputs a prediction tap pattern, as shown in Fig. 28, used in 
the prediction calculation unit 100 to the prediction calculation unit 100. The prediction tap setting unit 102 produces 
interpolated pixels using a prediction tap pattern as shown in Fig. 28. In this way. the prediction calculation unit 100 exe- 

45 cutes a linear prediction calculation with the group of prediction coefficients corresponding to the SD video signal Si to 
produce interpolated pixels. 

Next, the upconvertor 90 according to the ninth embodiment and the operation of each unit of the upconvertor 90 
will be described. 

an SD video signal Si inputted to the first-stage block 91 of the upconvertor 90 is extracted in the unit of 5 x 5 pixel 
50 blocks in which a remarked pixel is the center, and each pixel block is classified in the one-bit ADRC classification unit 
92. Then, a dynamic range DR of each pixel block outputted from the one-bit ADRC classification unit 92 is compared 
with a predetermined threshold TH in the comparator circuit 95. At this time, if the dynamic range DR is larger than the 
threshold TH. the mutti-bit ADRC classification unit 96 is executed. Then, a class cod dO is read from the ROM 94 based 
on the results of the one-bit ADRC classification unit 92 and the multi-bit ADRC classification unit 96. On the contrary. 
55 if the dynamic range DR is smaller than the threshold TH. a class code is read from the ROM 94 based only on the 
result of the one-bit ADRC classification unit 92 without executing the multi-bit ADRC classification unit 96. 

In this way. the class code dO is read from the ROM 94 in accordance with the results of the one-bit ADRC classi- 
fication unit 92 and/or the multi-bit ADRC classification unit 96. and supplied to the prediction coefficient RAM 99 as 
address data. The intra-space activity is roughly evaluated in the first-step classification and a detailed classification is 
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executed in accordance with the intra-space activity at the second step, so that the inputted SD video signal Si can be 

appropriately classified reflecting the intra-spatial activity thereof. 

A set of prediction coefficients are read from the prediction coefficient RAM 99 in accordance with the class code 

do and supplied to the prediction calculation unit 100. The prediction calculation unit 100 performs a prediction calcu- 
5 lation using the prediction coefficients on the prediction tap pattern selected in the prediction tap pattern selector 102 

to produce and output HD interpolated pixels. 

According to the configuration described above, a rough classification is executed at the first step and a detailed 

classification is executed at the second step, so that an input SD video signal S^ can be more appropriately classified 

in detail at the second step in accordance with the signal characteristics of the input SD video signal S^ revealed by the 
w first-step classification. This produces similar effects to the foregoing embodiments. In addition, if the classification is 

completed only at the first step, the entire classification processing can be reduced. 

(10) Other Embodiments 

15 While the foregoing embodiments have dealt with the case where an ADRC classification technique is used as a 
classification method based on compressed data of input video signal. However, the present invention is not limited to 
this, but classification can be executed by compressing data using other techniques including, for example, those 
employing DPCM (Differential Pulse Code Modulation). VQ (Vector Quantization) and MSB (Most Significant Bit), bina- 
rization. discrete cosine transform (DCT). and so on. 

20 Also, while the foregoing embodiments have dealt with the case where the classification unit 12 executes the clas- 

sification at two steps or at three steps. However, the present invention is not limited to the classification completed by 
such particular numbers of steps, but the classification can be executed at a larger number of steps if the adaptability 
is successively increased at subsequent steps after a rough classification at the first step. With a larger number of 
steps, a more accurate classification can be accomplished. 

25 Further, while the foregoing embodiments have dealt with the case where a two-dimensional non-separable fitter is 

used as an upconvertor. However, the present invention is not limited to this type of upconvertor, and can be applied to 
an upconvertor 110 comprising a vertical/horizontal separable filter configuration as illustrated in Fig. 29. where parts 
corresponding to those in Fig. 4 are designated the same reference numerals. 

In the upconvertor 1 10. an SD video signal Si inputted through an input terminal IN is first supplied to a classrtica- 

30 tion unit 12 and a prediction calculation unit 111. The prediction calculation unit 1 1 1 is divided into two sets: a vertical 
prediction calculation unit 111A and a horizontal prediction calculation unit 11 IB corresponding to positions mode 1 
and mode 2 on a scanning line, respectively, and a vertical prediction calculation unit 1 1 1C and a horizontal prediction 
calculation unit 1 1 1 D corresponding to positions mode 3 and mode 4 on a scanning line, respectively. The classification 
unit 12. as adapting the classification of the foregoing embodiment, generates a class code dO in accordance with the 

35 inputted SD video signal S,, which is supplied to a prediction coefficient ROM 1 12 serving as a storage means which 
previously stores tap prediction coefficients. The prediction coefficient ROM 112 is divided into a vertical coefficient 
ROM 1 12A for storing vertical components of tap prediction coefficients and a horizontal coefficient ROM 1 12B for stor- 
ing horizontal components of the tap prediction coefficients. The class code dO is supplied to each of the vertical coef- 
ficient ROM 1 12A and the horizontal coefficient ROM 1 12B. 

40 First, a vertical prediction coefficient d6 outputted from the vertical coefficient ROM 1 1 2 A is supplied to vertical pre- 

diction calculation units 1 1 1 A and 1 1 1 C. The vertical prediction calculation units 1 1 1 A and 1 1 1 C generate vertical esti- 
mated values 67 and d8 by a product sum calculation of the inputted SD video signal Si and the vertical prediction 
coefficient d6. The vertical estimated values d7 and d8 are supplied to horizontal prediction calculation units 1 1 1B and 
1 1 1 D at the next stage, respectively. 

45 A horizontal prediction coefficient d9 generated from the horizontal coefficient ROM 1 128 is supplied to horizontal 

prediction calculation units 1 1 1 B and 1 1 1 D. The horizontal prediction calculation units 1 1 1 B and 1 1 1 D produce HD 
pixel signals dIO and d1 1 by performing a product sum calculation of the horizontal prediction coefficient d9 and the 
vertical estimated values d7 and d8. The HD pixel signals dIO and d1 1 are selectively supplied to a selector 15 where 
it is appropriately rearranged, whereby an HD signal S2. which is a final output, is outputted from an output terminal 

50 OUT. 

Also, while the foregoing embodiments have dealt with the case where prediction coefficients, each representing a 
correlation between a remarked SD pixel and transmitted pixels around the remarked pixel, are used to produce HD pix- 
els around the remarked pixel from SD pixels. However, the present invention is not limited to this form of producing HD 
pixels, but. predicted values for HD interpolated pixels can be previously set for each class instead of the prediction 
55 coefficients, and stored in a storage means. Conversion of an SD video signal into an HD video signal using predicted 
values may be performed by an upconvertor 1 15 as shown in Fig. 30. where parts corresponding to those in Fig. 4 are 
designated the same reference numerals. 

In the upconvertor 115. an SD video signal S, is supplied to a classification unit 12 through an input terminal IN. 
The classification unit 12 generates a class code dO based on the characteristics of SD pixels around HD interpolated 
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pixels which are interpolated pixels to be newly produced, and supplies the class code dO to predicted value ROMs 
1 16A to 11 6D. The predicted value ROMs 1 16A to 1 16D store predicted values constituting prediction data for HD inter- 
polated pixels previously calculated by learning in correspondence to the class code dO for each class. Predicted values 
d20 to d23 for HD interpolated pixels are read from the predicted value ROM 116 with the class code dO used as 

5 address data, and outputted through a selector 15 to an output terminal OUT. In this way, it is possible to produce a high 
resolution video signal having the predicted values d20 to d23 used as HD interpolated pixels inserted into signal pixels 
constituting the inputted video signal . 

A first method for calculating the predicted values may be a learning method using a weighted average technique. 
The weighted average technique classifies remarked pixels using SD pixels around the remarked pixels, and divides a 

10 pixel value of remarked pixels (i.e., HD pixels) added up for each class by a frequency incremented in accordance with 
the number of remarked pixels. These operations are performed on a variety of images to derive predicted values. 

A second method for calculating predicted values may be a learning method by normalization. This learning 
method first forms a block comprising a plurality of pixels including a remarked pixel, and utilizes a dynamic range in the 
block to normalize a value calculated by subtracting a reference value of the block from the pixel value of the remarked 

15 pixel. Next, an accumulated value of the normalized values is divided by an accumulated frequency to derive a pre- 
dicted value. 

Further, in the foregoing embodiment, an intra-activity is evaluated for an inputted SD video signal, and tap pattern 
for executing classifications is selected on the basis of a result of the evaluation. However, the present invention is not 
limited thereto and an intra-activity is evaluated for an inputted SD video signal, and a prediction tap pattern calculated 
2C by a linear first-order combination of a prediction coefficient in a prediction calculation unit on the basis of the result of 
the evaluation. In this case, for example, a control signal CT supplied from the classification unit 50A shown in Fig. 14 
is supplied to the prediction tap pattern unit 60 in the calculation unit 50C. 

Further, in the foregoing embodiment, a prediction coefficient of a prediction coefficient memory is read on the 
basis of both a class code supplied from first-step classification unit and a class code supplied from second-step clas- 
ps sif ication unit. However, the present invention is not limited thereto and a prediction coefficient of a prediction coefficient 
memory can be read using either a class code supplied from fjrst-step classification unit or a class code supplied from 
second -step classification unit. 

Furthermore, in the foregoing embodiment, an SD video signal is converted into an HD video signal. However, the 
present invention is not limited thereto and the SD video signal can be applied to the generation of interpolated pixels 
30 for enlarging and changing an image. Also, the SD video signal can be applied to a signal converting apparatus, such- 
as an converter for converting signals of NTSC method into signals of PAL {phase alternation by line), for converting 
signals of few number of pixels into signals of many number of pixels. Also, the SD video signal can be applied to YC 
separating apparatus for generating higher accuracy signals than former signals. 

According to embodiments of the present invention as described above, an intra-space activity is evaluated and 
35 Classified for each block of an inputted video signal, and each block of the input video signal is classified at an appro- 
priate number of steps in accordance with an activity code generated as a result of the classification. Thus, the activity 
code generated at the first step and the result of classifications at previous steps are reflected to classifications at next 
and subsequent steps, so that a highly accurate classification can be accomplished for the input video signal, and 
highly accurate interpolated pixels can be predicted and produced for a low resolution input image, using prediction 
40 coefficients or predicted values based on the results of the classifications, thereby enabling to realize the signal conver- 
ter and signal converting method which can produce a high resolution video signal. 

In the limits of not deviating an opinion of the present invention, various changing and application example are con- 
sidered. Therefore, the present invention is not limited by or to the described embodiments. 

While there has been described in connection with the preferred embodiments of the invention, it will be obvious to 
45 those skilled in the art that various changes and modifications may be aimed, therefore, to cover in the appended claims 
all such changes and modifications as fall within the true scope of the invention. 

Claims 

50 1 . A signal converting apparatus for converting an inputted first video signal into a second video signal different from 
the first video signal, comprising: 

means for evaluating an intra-space activity for said first video signal to generate an activity code; 
means for executing stepwise classifications based on said activity code to generate a class code based on 
55 the result of said classification: 

a prediction coefficient memory stored with prediction coefficients for predictively producing said second video 
signal using said first video signal; and 

means for predictively calculating said first inputted video signal using said prediction coefficient read from said 
prediction coefficient memory according to said activity code and/or said class code to produce said second 
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video signal. 

2. The signal converting apparatus according to claim 1 . wherein said first video signal is a low resolution video signal, 
and said second video signal is a high resolution video signal which is higher. resolution than said low resolution 

5 video signal. 

3. The signal converting apparatus according to claim 1 . wherein said second video signal is a video signal which has 
the number of pixels more than said first video signal. 

10 4. The signal converting apparatus according to claim 1 . wherein said activity code producing means evaluates an 
intra-space activity and a temporal direction activity for said first video signal to generate an activity code. 

5- The signal converting apparatus according to claim 1 . wherein said class code producing means sets a plurality of 
different pixel patterns for said first video signal to select a pixel pattern from the set plurality of pixel patterns in 
15 accordance with said activity code, and classifies said first video signal using the selected pixel pattern in order to 
generate a class code. 

6. The signal converting apparatus according to claim 1, wherein said activity code producing means evaluates an 
intra-space activity using a dynamic range of each pixel in a neighboring region of a remarked pixel in said first 

20 video signal in order to generate an activity code. 

7. The signal converting apparatus according to claim 1. wherein said activity code producing means evaluates an 
intra-space activity in accordance with a level distribution of quantized value obtained based on a dynamic range 
defined by pixels in a neighboring region of a remarked pixel in said first video signal in order to generate an activity 

25 code. 

8. The signal converting apparatus according to claim 1. wherein said activity code producing means evaluates an 
intra-space activity using a standard deviation obtained from signal distribution of each pixel in a neighboring region 
of a remarked pixel in said first video signal in order to generate an activity code. 

30 

9. The signal converting apparatus according to claim 1 , wherein said activity code producing means evaluates an 
intra-space activity in accordance with frequency distribution of quantized value obtained based on a dynamic 
range defined by pixels in a neighboring region of a remarked pixel in said first video signal in order to generate an 
activity code. 

35 

10. The signal converting apparatus according to claim 1. wherein said activity code producing means evaluates an 
intra-space activity in accordance with frequency distribution of differences of respective adjacent pixel values for 
each pixel in a neighboring region of a remarked pixel in said first video signal in order to generate an activity code. 

40 11. The signal converting apparatus according to claim 1. wherein said activity code producing means evaluates an 
intra-space activity based on laplacian values obtained in respective intra-space different direction using laplacian 
filters in order to generate an activity code. 

12. The signal converting apparatus according to claim 1, wherein said activity code producing means evaluates an 
45 intra-space activity and a temporal direction activity based on laplacian values obtained intra-space and temporal 

direction different directions using laplacian filters in order to generate an activity code. 

13. The signal converting apparatus according to claim 1. wherein said activity code producing means evaluates an 
intra-space activity in accordance with level distribution of each pixel in a neighboring region of a remarked pixel in 

50 said first video signal in order to generate an activity code. 

14. The signal converting apparatus according to claim 1 . wherein said class code producing means classifies said first 
video signal by quantizing said first video signal on the basis of an dynamic range defined by each pixel in a neigh- 
boring region of a remarked pixel in said video signal in order to generate a class code. 

55 

15. The signal converting apparatus according to claim 14. wherein said class code producing means adaptively 
changes a level resolution capability in case of quantizing said first video signal in accordance with said activity 
code. 
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16. The signal converting apparatus according to claim 4. wherein said class code producing means sets a pixel pat- 
tern for a wide region, a pixel pattern for a narrow region as compared with said wide region, and a standard pixel 
pattern correspond to between said wide region and said narrow region, for said first video signal. 

17. The signal converting apparatus according to claim 1. wherein: 

said activity code producing means quantizes each pixel in a neighboring region of a remarked pixel in said 
first video signal to compress data and evaluates an intra-space activity in accordance with characteristics of a level 
distribution for the quantized value in order to produce said activity code: and 

said class code producing means sets an adaptive pixel pattern in accordance with said activity code, and 
sets predetermined number of bits for each pixel of the pixel pattern in accordance with said activity code to quan- 
tize it. 

18. A signal converting apparatus for converting an inputted first video signal into a second video signal different from 
the first video signal, comprising: 

means for evaluating an intra-space activity for said first video signal to generate an activity code: 

means for perlorming stepwise classification on the basis of said activity code to generate a class code on the 

basis of the result of the classification; and 

means, having a prediction value storing memory stored with a prediction value generated as an interpolated 
pixel signal for said first video signal, for reading and outputting a prediction value corresponding to said activity 
code and/or said class code. 

19. The signal converting apparatus according to claim 18. wherein said first video signal is a low resolution video sig- 
nal, and said second video signal is a high resolution video signal which is higher resolution than said low resolution 
video signal. 

20. The signal converting apparatus according to claim 18. wherein said second video signal is a video signal which 
has the number of pixels more than said first video signal. . 

21. The video signal converting apparatus according to claim 18, wherein said activity code producing means evalu- 
ates an intra-space activity and temporal direction activity for said first video signal in order to generate an activity 
code. 

22. The video signal converting apparatus according to claim 18. wherein said class code producing means sets a plu- 
rality of different pixel patterns for said first video signal to select a pixel pattern from the set plurality of pixel pat- 
terns in accordance with said activity code, and classifies said first video signal using the selected pixel pattern in 
order to generate a class code. 

23. A video signal converting method for converting an inputted first video signal into second video signal different from 
first video signal, comprising the steps of: 

evaluating an intra-space activity for said first video signal to generate an activity code; 

performing stepwise classification based on said activity code to generate a class code based on the result of 

the classification; 

reading a prediction coefficient stored in a prediction coefficient memory for predictively producing said second 
video signal, using said first video signal in accordance with said activity code and/or said class code; 
performing a prediction calculation for said first inputted video signal using the read prediction coefficient; and 
outputting a prediction calculation value as said second video signal. 

24. The signal converting method according to claim 23. wherein said first video signal is a low resolution video signal, 
and said second video signal is a high resolution video signal which is higher resolution than said low resolution 
video signal. 

25. The signal converting method according to claim 23. wherein said second video signal is video signal which has 
the number of pixels mo'-L* than said first video signal. 

26. The signal converting method according to claim 23. wherein said activity code producing step evaluates an intra- 
space activity and a temporal direction activity for said first video signal 



21 



EPO 746 157 A2 



27. The signal converting method according to claim 23. wherein said class code producing step sets a plurality of dif- 
ferent pixel patterns for said first video signal to select a pixel pattern from the set plurality of pixel patterns in 
accordance with said activity code and classifies said first video signal using the selected pixel pattern. 

28. The signal converting method according to claim 23. wherein said activity code producing step evaluates an intra- 
space activity using a dynamic range for each pixel in a neighboring region of a remarked pixel in said first video 
signal. 

29. The signal converting method according to claim 23. wherein said activity code producing step evaluates an intra- 
space activity in accordance with a level distribution for quantized value obtained based on a dynamic range 
defined by pixels in a neighboring region of a remarked pixel in said first video signal. 

30. The signal converting method according to claim 23. wherein said activity code producing step evaluates an intra- 
space activity using a standard deviation obtained from a signal distribution of each pixel in a neighboring region of 
a remarked pixel in said first video signal. 

31 . The signal converting method according to claim 23. wherein said activity code producing step evaluates an intra- 
space activity in accordance with a frequency distribution for a quantized value obtained based on a dynamic range 
defined by pixels in a neighboring region of a remarked pixel in said first video signal. 

32. The signal converting method according to claim 23. wherein said activity code producing step evaluates an intra- 
space activity in accordance with a frequency distribution of differences of respective adjacent pixel values for each 
pixel in a neighboring region of a remarked pixel in said first video signal. 

33. The signal converting method according to claim 23. wherein said activity code producing step evaluates an intra- 
space activity based on a laplacian value obtained in respective intra-space different directions using laplacian fil- 
ters. 

34. The signal converting method according to claim 26. wherein said activity code producing step evaluates an intra- 
space activity and a temporal direction activity based on a laplacian value respectively obtained in intra-space 
direction and temporal direction different directions using laplacian filters. 

35. The signal converting method according to claim 23. wherein said activity code producing step evaluates an intra- 
space activity in accordance with a level distribution in a neighboring region of a remarked pixel in said first video 
signal. 

36. The signal converting method according to claim 23. wherein said class code producing step classifies said first 
video signal by quantizing said first video signal based on a dynamic range defined by each pixel in a neighboring 
region of a remarked pixel in said first video signal. 

37. The signal converting method according to claim 36. wherein said class code producing step changes a level res- 
olution capability in case of quantizing said first video signal in accordance with said activity code. 

38. The signal converting method according to claim 26. wherein said class code producing step sets a pixel pattern 
for a wide region, a pixel pattern for a narrow region as compared with said wide region, and a standard pixel pat- 
tern corresponding to between said wide region and said narrow region, for said first video signal. 

39. The signal converting method according to claim 26, wherein: 

said activity code producing step quantizes each pixel in a neighboring region of a remarked pixel in said first 
video signal to compress data and evaluates an intra-space activity in accordance with characteristics of a level dis- 
tribution for the quantized value; and 

said class code producing step sets an adaptive pixel pattern in accordance with said activity code, and sets 
predetermined number of bits for each pixel of the pixel pattern in accordance with said activity code to quantize it. 

40. A signal converting method for converting an inputted first video signal into a second video signal different from the 
first video signal, comprising the steps of; 

evaluating an intra-space activity for said first video signal and outputting an activity code; 

performing stepwise classifications based on said activity code a-^d outputting a class code based on a result 
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of the classification; 

reading a prediction value stored in a prediction value memory in accordance with said activity code and/or 
said class code: and 

outputting said prediction value produced as an interpolated pixel signal for said first video signal. 

41. The signal converting apparatus according to claim 40. wherein said first video signal is a low resolution video sig- 
nal, and said second video signal is a high resolution video signal which is higher resolution than said low resolution 
video signal. 

42. The signal converting method according to claim 40. wherein said second video signal is a video signal which has 
the number of pixels more than said first video signal. 

43. The signal converting method according to claim 40, wherein said activity code producing step evaluates an intra- 
space activity and a temporal direction activity for said first video signal. 

44. The signal converting method according to claim 40. wherein said activity code producing step sets a plurality of 
different pixel patterns for said first video signal, selects a pixel pattern from the set plurality of pixel patterns in 
accordance with said activity code, and classifies said first video signal using the selected pixel pattern. 
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